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ABSTRACT 

In the last decade, the advent of enormous galaxy surveys has motivated the devel- 
opment of automated morphological classification schemes to deal with large data 
volumes. Existing automated schemes can successfully distinguish between early and 
late type galaxies and identify merger candidates, but are inadequate for studying 
detailed morphologies of red sequence galaxies. To fill this need, we present a new au- 
tomated classification scheme that focuses on making finer distinctions between early 
types roughly corresponding to Hubble types E, SO, and Sa. We visually classify a 
sample of 984 non-starforming SDSS galaxies with apparent sizes > 14". We then de- 
velop an automated method to closely reproduce the visual classifications, which both 
provides a check on the visual results and makes it possible to extend morphologi- 
cal analysis to much larger samples. We visually classify the galaxies into three bulge 
classes (BC) by the shape of the light profile in the outer regions: discs have sharp 
edges and bulges do not, while some galaxies are intermediate. We separately iden- 
tify galaxies with features: spiral arms, bars, clumps, rings, and dust. We find general 
agreement between BC and the bulge fraction B/T measured by the galaxy modeling 
package GIM2D, but many visual discs have B/T > 0.5. Three additional automated 
parameters - smoothness, axis ratio, and concentration - can identify many of these 
high- .B/T discs to yield automated classifications that agree ~ 70 per cent with the 
visual classifications (> 90 per cent within one BC). Tests versus disc inclination in- 
dicate that both methods identify most face-on discs, but visually, features are lost 
in edge-on discs. Eighty per cent of face-on visual discs have features while few visual 
bulges do, strongly validating the visual classifications. Given the good agreement be- 
tween the visual and automated methods, we believe that the automated method can 
be applied to a much larger sample with confidence. Both methods are used to study 
the bulge vs. disc frequency as a function of four measures of galaxy 'size': luminosity, 
stellar mass, velocity dispersion (a), and radius (R). All size indicators show a fall in 
disc fraction and a rise in bulge fraction among larger galaxies. 

Key words: galaxies: evolution - galaxies: structure - galaxies: elliptical and lentic- 
ular - galaxies: bulges 



1 INTRODUCTION 

With l arge-scale surveys like the Sloan Digital Sky Survey 
(SPSS. [York et all 1200(f). studies of galaxy properties are 
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now possible for larger samples than ever before. The SDSS 
has obtained imaging of 11,663 square degrees of sky in five 
bandpasses and spectra of 929,555 galaxies in the local uni- 
verse. Photometry and structural parameters are available 
for 357 million unique objects, and analyses of large sam- 
ples of both spectra and photometry are the topic of cur- 



2 Cheng et al. 



rent research. In the coming years, other large scale surveys, 
including Pan- STARRS^ and LSST0 will be adding to the 
available data. 

The present study is motivated by the desire to bring to 
bear this great wealth of information on the question of the 
star formation histories of galaxies, and in particular how 
some galaxies had their star formation shut down such that 
their star formation rates today are low or zero; morphologi- 
call y most of these a re Es and SOs, hereafter early type galax- 
ies. IStrateraeTaD (|200ll ). in an analysis of 150,000 SDSS 
galaxies, noted that the color distribution of all galaxies is 
distinctly bimodal. At bright magnitudes, galaxies populate 
both a red sequence and blue cloud, whi ch are separated by a 
dearth of galaxies in a green valley fe.g jBaldrv et alj|2004 ). 
On average, galaxies on the red sequence l ive in denser envi- 
ronments compared to blue galaxies (e.g.jHogg et al. 20031; 



Blanton et al.ll2005al;ICooper et alj200d : Martinez fc Muriell 



20061 ; ISkibba et all 120091 ) Lookback observations have fur 



ther shown that the relative fraction of red sequence galaxies 
is increasing with time, having at least doubled since z = 1 
jBell et al.ll2004l ; iBrown et alBooH ; iFaber et alj|2007h . 

Red sequence galaxies primarily owe their colors to 
a lack of hot, young stars, which are indicators of recent 
star formation. One persistent question in galaxy evolution 
is what mechanism causes a blue, star forming galaxy to 
quench - to stop forming new stars and become 'red and 
dead' - and migrat e from the blue cloud to the red sequence. 
IFaber et alj (|2007t ) favored a 'mixed scenario' for the for- 
mation of red galaxies to explain both the observed lumi- 
nosity functions and the properties of nearby red galaxies. 
In this scenario, blue galaxies increase their mass through 
star formation and mergers until quenching occurs. The end 
products of this process populate the intermediate and less 
massive end of the red sequence. Massive ellipticals on the 
bright end of the red sequence are built up through dry (gas 
poor) mergers of these less massive galaxies along the red 
sequence. 

This picture, however, still does not specify the ex- 
act physical mechanism(s) responsible for cutting off a 
galaxy's gas supply, thereby preventing the formation of 
new stars. At the risk of oversimplifying, major theories 
broadly fall into two classes. The first class says that feed- 
back from the formation of a black hole or a starburst dur- 
ing a major merger can expel the gas (e.g., ISanders et alj 
ll98Sl ; lHopkins et al.|[200^ 1. Observations indicate that AGN 
are capable of re l easing large amount s of energy (e.g., 
iFabian et all 120061 ; iRaffertv et alj 120061 ). and simulations 
suggest that the incorporation of feedback mechanisms can 
reproduce th e observed luminosi ty functions and colors of 
galaxies (e.g.. ICroton et al]|2006l ). 

The second class posits a critical dark halo mass 
(Mdit ~ W 12 A4q), above which cooling is too in- 
efficient to allow central ga l axies to accrete new 
gas (e.g., iRees fe Ostrikerl 1 19771 ; iBlumenthal et alj 1 1984 
iDekel fe Birnboirrj|2006h . In this latter scenario, a satellite 
galaxy falling into such a large halo will also be quenched. 
Some recent observations have shown evidence of the ex- 
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istence of a critica l halo mass (e.g., IBrown et alj 120081 ; 
iPasauali et"aUl2009h . 

It is surprisingly hard to distinguish the two theories ob- 
servationally. For example, both pictures predict that most 
galaxies now undergoing quenching will be found in small 
groups, and thus in modestly overdense environments. The 
mod erate velocity dispe rsions of such groups promote merg- 
ing (|Binnev et al.111982] ). as needed by the merger/feedback 
model, while the halo masses of such groups are also 
typically near 1O 12 M0, which puts them near M cr it- In- 
deed, semi-analytic models indicate that many galaxies near 
M* in the mass function both experience major mergers 
and cross the M cr j t halo boundary nearly simultaneously 
jCattaneo et al.ll200cf ). 

Galaxy morphologies, however, may be able to provide 
key independent data with which to test the two scenarios. 
Ellipticals, for e xample, are believed to b e formed through 
major mergers (|Toomre fe Toomrel [l972T ). Disc-dominated 
galaxies, on the other hand, cannot have undergone re- 
cent major mergers, as such mergers would h ave severely 
disrupted their shapes (|Toth fc Ostrikerl 119921 ). Thus, nei- 
ther class of quenching models is independently sufficient to 
explain the full range of morphological types observed for 
quenched galaxies. Mergers alone cannot explain the pres- 
ence of quenched discs, and, likewise, a critical halo mass 
alone cannot explain why quenched gala xies typically have 
ellipti cal morphologies. A recent paper by Ivan der Wei et alj 
|2009l ). for example, uses this latter point to argue that the 
roundness of galaxies with M > 10 11 Mq is evidence for the 
importance of major merging in the formation of the most 
massive galaxies. 

These two scenarios, of course, are not necessarily mu- 
tually exclusive, but if different mechanisms are responsible 
for the formation of bulges and discs (Es and SOs), then 
it is important to study them separately. In particular, the 
frequencies of each type may shed light on the relative im- 
portance of each mechanism. Thus, the bulge/disc frequency 
along the red sequence and versus other galaxy properties, 
such as galaxy size, will place an important new constraint 
on the theories of quenching. 

Currently, only a rough knowledge of bulge/disc fre- 
quencies exists. Many groups have examined how the lumi- 
nosity functio ns of different morphological types evolve with 
redshift (e.g.. iBrinchmann fc Ellis! l200fj|; iBuiidv et alj|2005l; 



Franceschini et alj 120061 ; lllbert et alj |2006l ; iPannella et alj 
20061 , 120091 ). but these studies lump Es and SOs together into 



one group. Other studies of nearby galaxies treat Es and SOs 
separately but have total sample sizes (includ ing both early 
and late types) of only a few thousand (e.g.. iMarzke et all 
ll994l;lMarinoni et al.|[l999l ). To quantify bulge/disc frequen- 
cies and study them in detail, we need a large sample of red 
sequence galaxies, such as is available from the SDSS, plus 
a method to classify them into bulges and discs. 

One crucial requirement of such a method is the ability 
to identify star-forming galaxies that are on the red sequence 
because they ar e reddened by dust . For the SDSS Main 
Galaxy Sample jStrauss et alj [20021 ) . this can be accom- 
plished using spectroscopic information. In addition, these 
galaxies typically have features that are associated with star 
formation, such as spiral arms, clumps, or dust lanes, that 
distinguish them from galaxies that are truly quenched. 

Red sequence galaxies, then, can be thought of as falling 
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into three main groups: bulges, smooth discs, and unsmooth 
discs. This last group consists of discs with features generally 
associated with star formation (such as spiral arms, clumps, 
and dust) that cause the light profile of the galaxy to deviate 
from a smooth model. Neglecting bars and rings, which may 
be found in any disc-dominated galaxy, these three groups 
correspond roughly to the Hubble types E, SO, and Sa+later, 
respectively. These divisions accomplish our goal of distin- 
guishing between bulges and discs, as well as accounting for 
the contamination of the red sequence by galaxies that may 
not be truly quenched (unsmooth discs). 

Traditionall y, morphological classification has been 
done by eye (e.g. Jde Vaucouleurs et al]|l99ll ; lFukugita et al.l 
I2007I ). Visual classification, however, is time-consuming 
and the resulting galaxy samples are small - two of the 
largest such samples consist of 2253 galaxies from SDSS 
jFukugita et all 120071 ) and 3314 galaxies from the Millen- 
nium Galaxy Catalog (|Driver et all 120061 ). One novel ap- 
proach taken by the Galaxy Zoo team involves employing 
~ 10 5 'citizen sc ientists' to visually classify 40 million galax- 
ies m the SDSS jLintott et alJl200St ). At this time, however, 
their classifications make no distinction between Es and SOs. 
To take advantage of the full sample of SDSS, we will need 
an automated method of morphological classification that 
can be done quickly on a large sample of galaxies and is yet 
capable of making fine distinctions among early type galax- 
ies. 

S everal automat ed methods, including the CAS sys- 
tem (IConselicel |2003l) and the Gini a nd M20 parameters 
(| Abraham et al.ll2003l ; FCotz et all 120041 ) . have already been 
used to classify SDSS galaxies and are able to reliably dis- 
tinguish between early and late type galaxies. But under 
these methods, the morphologies we are interested in distin- 
guishing (bulges, smooth discs, unsmooth discs) are mostly 
grouped together as 'early types.' Our approach is to capi- 
talize on two existing sources of automated parameters, the 
SDSS photometric pipeline and the galaxy modeli ng pack- 
age Galaxy IMage 2 D (GIM2D; ISimard et alj|2002f i Galaxy 
IMage 2D (GIM2D; ISimard et all |2002T I, to find combina- 
tions of automated parameters that can successfully sort red 
sequence galaxies into bulges, smooth discs, and unsmooth 
discs. 

For SDSS galaxies, one widely used automated param- 
eter is the concentration C = R90/R50, where Rqo and R50 
are the radii containing 90 and 50 per cent of the Petrosian 
flux in the r-band, respectively^ Classical bulges, which 
have bright central regions and extended outer envelopes, 
have high C because the bulk of t heir light is located at small 
radii. [Shimasaku et al.l (|200ll ) and lStrateva et all (120011 ) find 
that dividing galaxies using C gives automated samples of 
early and late types with about 15-20 per cent contamina- 
tion from the opposite class. This is promising but still does 
not accomplish our goal of distinguishing Es and SOs. 

The galaxy modeling package GIM2D has also been 
used to classify galaxies by measuring several quantitative 
parameters. GIM2D is an IRAF package which models a 



3 The Petrosian flux is defined as the flux contained within twice 
the Petrosian radius, the circular radius at which the local surface 
brightness /i(r) is e qual to 20 per cent of the enclosed mean surface 



galaxy image as the sum of two light profiles: e.g., a de Vau- 
couleurs (bulge) component and an exponential (disc) com- 
ponent. GIM2 D fits are available f or over a million galaxies 
in SDSS DR7 (|Simard et al.|[2010l ). making it a useful tool 
for a large statistical study. G1M2D has been used to study 
the properties of bulge and disc components s eparately in 



galaxies from the Millennium Galaxy Catalog u e ([Alien et al 



l2006t 



Driver et ai]|2007l ; ICameron et al.ll2009l ) . ISimard et al 



brightness /x(< r) jBlanton et al.ll200ll ). 



(|2002T ) have shown that GlM2D's bulge fraction B/T and 
smoothness S can be used to identify early type galaxies. 
Ilm et af] (|2002l ) used these criteria to obt ain a sample of field 
E/SOs in the DE EP Groth St rip Survey dWeiner et al.ll2005l : 
IVogt et al.ll2005l ). In addition. [Mcintosh et all (|2004l ) used S 
to study the presence of substructure in cluster disc galaxies. 
These two parameters, along with the SDSS-measured con- 
centration C, are a natural starting point for our automated 
method. We will also find that the SDSS-measured axis ratio 
b/a is useful in distinguishing the different morphologies. 

As described below, we initially use emission line 
strengths and line ratios from SDSS spectra to weed out 
most of the objects that are obviously star-forming. After 
requiring that the targets be included in a number of exter- 
nal catalogs (to facilitate future studies), we isolate a sample 
of roughly one thousand red sequence galaxies. These are 
classified by eye into three main groups based on the light 
profile of the galaxy at large radii: bulges, smooth discs, and 
unsmooth discs, the last group showing a variety of features. 
In addition to evaluating whether a galaxy is dominated by 
a bulge or disc component, we also judge whether it has any 
additional features, such as spiral arms or dust lanes, which 
would signal a cold disc. 

The main goal of this paper is to compare these vi- 
sual classifications with machine-derived structural param- 
eters from SDSS and GIM2D. This allows us to develop a 
method for reproducing the visual classifications using au- 
tomated parameters. With the benefit of our detailed visual 
inspections, we are able to take certain sets of machine pa- 
rameters in combination to isolate samples of bulge- and 
disc-dominated galaxies that agree with our visually classi- 
fied samples. The resulting recipes can readily be applied to 
a much larger sample of early type galaxies from SDSS in 
order to classify them by their morphologies with high accu- 
racy. Future use of this method may be possible with other 
samples with similar physical resolution scales, though some 
testing on these new samples will be required. 

Finally, the sample of a thousand objects is used to 
derive some preliminary conclusions on the frequencies of 
bulges and discs as a function of magnitude, stellar mass, 
velocity dispersion, and radius. The results are verified using 
both the visual and automated classification schemes. 

The paper is organized as follows: H2.ll describes the 
sample of red sequence galaxies used in this study. These 
galaxies are grouped into bulges, smooth discs, and un- 
smooth discs using our visual classification scheme ( i)2.2p . 
We then compare automated parameters - SDSS photomet- 
ric values (U23J| and GIM2D parameters ( $2A$ - with the 
visual classifications to develop an automated recipe for sort- 
ing galaxies into bulges, smooth discs, and unsmooth discs. 
i|2. 51 summarizes some preliminary conclusions based on the 
visual classifications. Galaxies are next sorted into auto- 
mated bulge and disc samples f £|3.1[) and the latter group 
is divided into smooth and unsmooth discs f i|3.2[) . In <}4]we 
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present the bulge/disc frequencies as a function of absolute 
magnitude, stellar mass, velocity dispersion, and radius. We 
summarize the work in fj5] Some additional notes on the 
automated parameters are discussed in [JA] The adopted 
cosmology is Qa = 0.7, fl m = 0.3, and Ho — 70 km s _1 
Mpc -1 . All absolute magnitudes and colors have been k- 
corrected to z — 0.0 using Michael Blanton's k-correct code 
(|Blanton et al.ll2003l . v3_2). 



2 DATA 

2.1 Sample selection 

The sample is extracted from the spectroscopic cata- 
log of galaxies in Data Relea se 4 (DR4) of the SDSS 
l|Adelman-McCarthv et al]|2006l ) . Because we are interested 
in studying quenched galaxies, we apply emission line cri- 
teria, based on e quivalent width (EW) measurements from 
lYan et all (|2006t ). to select galaxies whose spectra do not 
reveal obvious ongoing star formation. Such galaxies are 
defined as either lacking detectable emission (quiescent) or 
having LINER-like emissio n line ratios according to the cri- 
teria of lGraves et al.l I2007T ). Quantitatively, galaxies lacking 
emission satisfy the conditions EW[Oll] < 3 A, EW(Hq) 
< 0.7 A, while LINER-like galaxies satisfy the conditions 
EW[OH] > 3 A, EW[0II] > 5x EW(Ha)-7 . The latter 
criterion follows from the results of lYan et all (|2006T ). who 
showed that galaxies with high [Oil] /Ha ratios (LINERS) 
are not actively forming stars. Galaxies with EW[Oll] > 

3 Aand low [Oil] /Ha ratios contain star formation and/or 
Seyfert activity, and we do not include them in our sample. 

We choose only galaxies with spectra having S/N > 
20 A 1 in order to have a sample with reliably measured 
equivalent widths. Of the remaining galaxies, ~ 90 per 
cent have specific star formation rates less than 0.1 Mq 
yr~ 1 /lO 11 M0 and are predominantly located on the red 
sequence or in the green valley, though there is a tail of 
bluer objects0 The bluest objects in this sample are re- 
moved by requiring that the sample satisfy the relation 
g-r > -0.025r + 0.10 There are known issues wit h SDSS 
photometry for large galaxies (e.g. jLauer et al1l2007l ). which 
causes some extended blue discs to have red colors in the 
database. Because the color cut that we impose is gener- 
ous, problems with the g — r photometry will not affect the 
sample greatly. 

We acknowledge that our sample selection is not perfect, 
and there will be some star forming galaxies that make it 
through our selection criteria. These interlopers are present 
mainly because SDSS spectra are obtained using 3" fiber 
apertures. Some of the galaxies in our sample may be form- 
ing stars in their outer regions but satisfy our emission line 



4 Star formation rates are derived from Ho emission line lumi- 
nosities following Equation 2 of Kcnnicutt 1998. 

5 During the initial sample selection, the SDSS magnitudes used 
were not corrected for Galactic extinction. Consequently, there 
are six galaxies that do not satisfy this color cut once the corrected 
magnitudes are used. The corrected magnitudes are used for all 
analysis in this paper, and this issue only comes up in the sample 
selection. 



criteria because their non-starforming bulges dominate the 
spectral fiber aperture. 

Even with improved photometry from GIM2D, the g — r 
color is not sensitive enough to identify these star-forming 
galaxies. A much better way to find star-forming contam- 
inants is to pick out the bluest galaxies in NUV — r, us- 
ing GALEX photometry. Though we could eliminate the 
contaminants from the sample presented here, we elect to 
keep these objects because adding a selection criterion us- 
ing GALEX photometry would significantly limit the size 
of future samples of SDSS galaxies that could be examined 
with our automated classification scheme. Instead, we will 
show that most of these galaxies are unsmooth discs in our 
schema and can be identified and removed using the auto- 
mated scheme. Throughout the analysis (see t|3.2l Q we 
will return to this issue to confirm that this choice does not 
affect the results. 

We would also like these data to be useful for future 
work exploring how morphology correlates with UV colors 
and environment. For the former condition, we limit the 
sample to galaxies with a near-UV detection in Data Re- 
lease 2 (DR2) of the Galaxy Evolution Explorer (GALEX 
For the latter condition, we limit the sample to galaxies 
with measured overdensitie s in the NYU Value-Added Cat- 
alog (|Blanton et al.ll2005bT ). All of these galaxies also have 
another measure of environment, the 1 + S3 parameter of 
ICooper et all (|2008l L 

In numbers, the sample selection is as follows: 

• lYan et all ||2006h emission line measurements (SDSS 
DR4): ~ 400, 000 galaxies 

• Falls in GALEX DR2 footprint: 29,755 

• NYU Value-Added Catalog Environments (SDSS DR2): 
8,865 

• Median signal-to-noise ratio > 20 A : 3,331 

• Quiescent (915) or LINER (483): 1,398 

• g-r color cut (894 + 447): 1,341 

• GALEX NUV detection (870 + 425): 1,295 

Neither the g — r color cut nor the NUV detection cut 
removes more than 5 per cent of the sample, so we do not 
believe that they have introduced any important selection 
effects. The surviving 1,295 galaxies fall in the redshift range 
0.024 < z < 0.082. The lower limit is set by the Oil detec- 
tion requirement: the line is located at 3727 A while the 
SDSS spectra begin at 3800 A. The upper limit is set by 
the NYU Value-Added Catalog, which only has environment 
measured for galaxies with z < 0.082. 

2.2 Visual classification scheme 

The visual classifications are based on images obtained from 
the SDSS Image List ToolQ which generat es multi-band 
color JP EG thumbnails of each galaxy. (See iLupton et al.l 
|2004| and iNieto-Santisteban et al.l |2004| for details on how 
these images are made.) The images were examined on a 
computer monitor (as opposed to printed pages) in order to 
see the full dynamic range. 

The first visual classification parameter sorts galaxies 

6 http://galcx.stsci.edu/GR2/ 

7 http:/ /cas. sdss.org/astro/en/tools/chart/list. asp 
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according to their radial light profiles; this is the bulge class, 
BC. Bulge-dominated galaxies (BC = 1) have bright cen- 
tres with a very gradual fall-off in brightness at all radii. 
The outer regions are characterized by an extended enve- 
lope without a clear edge. Examples are shown in Figure Q] 
Disc-dominated galaxies (BC = 3), on the other hand, have 
a sharp outer edge where the light drops off dramatically, as 
well as a flatter light profile at intermediate radii just inside 
the outer boundary. Examples are shown in Figure [5] The 
stretch used for the thumbnails in the SDSS Image Tool is 
very tight near the sky level and shows these differences to 
good advantage. 

We explicitly acknowledge that our BC = 1 galaxies are 
not pure bulges, and our BC = 3 galaxies are not pure discs. 
For that reason 'bulge-dominated' and 'disc-dominated,' re- 
spectively, would be more appropriate terms. However, for 
brevity, we will use the terms 'bulges' and 'discs' in the rest 
of the paper. 

Objects that do not fall clearly into either the bulge 
or disc categories are classified as intermediate (BC = 2). 
Examples are shown in Figure [3] Many have characteris- 
tics indicating the presence of both bulge-like and disc-like 
components. These include galaxies with a gradual fall-off 
of light in some azimuthal sectors but also a sharp bound- 
ary in others (these are often edge-on or disturbed). Others 
are objects that appear to have a disc embedded within a 
more extended classical bulge. Many such discs tend to be 
edge-on, which suggests that we tend to identify such ob- 
jects preferentially when they are highly inclined. (See [J3] 
for more discussion.) We stress, however, that the identifi- 
cation of a disc depends purely on finding either sharp-edged 
or flattened (edge-on) features in the light distribution, and 
thus the presence of a dust lane or other features related to 
the classification parameter FEAT (described below) has 
nothing to do with the value of BC. 

The values for BC were first determined by J. C. and 
then agreed upon by consensus among J. C, G. G., and 
S. F. Between the first independent classification and the 
group consensus there was remarkably little disagreement 
(~ 3 per cent of the total sample had their BC values 
changed). Objects for which there was disagreement were al- 
ways on the borderline between the intermediate (BC = 2) 
category and the extremes (BC = 1 or 3). In no case was 
there disagreement between the two extremes. The number 
of objects moved in each direction when adopting final clas- 
sifications was about equal. That is, the number of objects 
moved from bulge to intermediate (BC = 1 to 2) was about 
equal to the number of objects moved from intermediate to 
bulge (BC = 2 to 1). 

When developing our visual scheme, a brief attempt 
was made to further distinguish between more bulge-like and 
more disc-like intermediates (which we coined '2-' and '2+', 
respectively), so as to make a 5-point scale in BC instead 
of a 3-point scale. The resulting classifications yielded small 
samples of each new type, and the great bulk of intermedi- 
ates remained in the middle bin (BC = 2) . Because the new 
classifications affected such a small number of the galaxies, 
the 3-point scale was kept for simplicity. This will be revis- 
ited for the analysis in ^3] 

The second visual classification parameter sorts galax- 
ies according to the presence of non-smooth 'features': spiral 
arms, bars, clumps, rings, or dust. Most galaxies have no vis- 



ible features (FEAT — 1). Other galaxies have features that 
we identify as weak (FEAT = 2) or strong (FEAT = 3). 
The latter are galaxies in which arms, bars, rings, and dust 
lanes can be traced over a considerable extent and/or indi- 
vidual clumps are highly visible. Examples of the different 
features and their varying strengths are shown in Figure 

For each galaxy, the strength of each type of feature is 
noted (absent, weak, or strong), resulting in five different 
numerical values between 1 and 3. The FEAT index is de- 
termined by taking the maximum of these five values. For 
example, a galaxy with strong spiral arms and a weak bar 
is classified as FEAT = 3. A galaxy with weak spiral arms 
and weak dust lanes is classified as FEAT = 2. A galaxy 
has FEAT = 1 only if it exhibits none of the five features. 

The FEAT index encompasses two types of features. 
One type is classically identified with star formation - spiral 
arms, clumps, and dust absorption. The second type - bars 
and rings - may exist in otherwise smooth discs without 
obvious star formation. A noteworthy finding in this work 
is that, with the rare exception of dust, none of the above 
feature elements are found to any significant degree in bulge- 
dominated (i.e., BC — 1) galaxies (cf. Figures [T] and 0. 
In other words, as expected, the presence of any of these 
features, of whatever kind, is an excellent predictor of a disc- 
dominated (i.e., BC — 3) galaxy, and the close correlation 
is independent confirmation of our purely light-based bulge- 
disc classifications (BC). 

Since our focus in this paper is on the relative numbers 
of bulges and discs, we have simply lumped together all types 
of features to define the FEAT index, though the exact 
nature(s) of the features detected (whether bar, spiral, etc.) 
is retained in the database. These features are also difficult 
to distinguish using automated parameters, so we elect to 
use only one visual index. While not all of the features are 
associated with star formation, most galaxies with FEAT ^ 
1 (78 per cent) have at least one starforming feature. Thus, 
we will use FEAT as an indicator of galaxies that are not 
truly quenched. 

We also separately identify any galaxies that are dis- 
turbed or interacting (some are shown in Figure [3} with 
indices DIST and INT. These make up a small portion of 
the sample (see £| 2 . 5 P and are not used in the development 
of the automated method (32}. As with the other visual pa- 
rameters, the values are determined by eyeball classification 
on a discreet scale from 1 to 3, with DIST(INT) = 1 being 
undisturbed (not interacting) and DIST(INT) — 3 being 
strongly disturbed (obviously interacting). Whereas FEAT 
measures the strengths of very specific types of morpholog- 
ical features, DIST and INT are completely separate from 
FEAT and measure the peculiarities in a galaxy's appear- 
ance or immediate environment. 

A major limitation of the visual scheme is the difficulty 
of classifying small galaxies. These galaxies appear generally 
compact, and little or no structure is visible. Because of their 
small size, the shape of the outer light profile is difficult to 
estimate visually, and no substructures can be resolved. If 
the radius encompassing 90 per cent of the Petrosian flux 
(Rgo) is used as a proxy for size, these galaxies have small 
J?90 compared to the rest of the sample, typically with values 
log Rgo 1.15 (-R90 ~ 14"). This criterion is able to identify 
75 per cent of the galaxies that were visually determined to 
be too small to classify reliably. We make the size cut using 
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BC = 1 (346 objects) 
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Figure 1. Examples of visual bulges (BC = 1). These are objects that are highly centrally concentrated and have no sharp outer edge. 
Each galaxy is labelled with its SDSS plate, modified Julian date, and fiber ID. The postage stamps are approximately 50 x 50 arcsecs. 
For visual classification, the images were examined on a computer screen in order to see the full dynamic range. 



an automated parameter to allow for future application to 
larger samples. After removing galaxies with log Rgo ^ 1.15, 
we are left with 998 galaxies in the sample. 



2.3 SDSS photometry 

Of these galaxies, 997 had full SDSS photometric fits, which 
are available as part of SDSS DR4. For each galaxy, we ob- 
tained the model magnitudes in the SDSS g- and r-bands, 
as well as the axis ratio in the r-band (b/a). The model 
magnitudes are calculated using either a de Vaucouleurs or 
exponential profile, with the choice of profile being deter- 
mined by likelihoods calculated from \ 2 fits- The axis ratio 
is always determined from the de Vaucouleurs fit. More de- 
tail on how these quantities are measured can be found in 
the d escription of the SDSS Data Release 2 (|Abazaiian et all 
2004). We also utilize the radii containing 50 per cent and 
90 per cent of the P etrosian flux in the r-band (R50 and R90; 
IBlanton et al ]|200ll) . The ratio R90/R50 is the concentration 
C. 



2.4 GIM2D parameters 

The galaxy images were run through GIM2D, which fits the 
light of a galaxy as the sum of a de Vau couleurs (bulge) pro - 
file and an exponential (disc) profile (|Simard et alj [2002? ) . 
The best-fit bulge+disc decomposition is found through 
a ^-minimization of 12 free parameters, as described by 
ISimard et all (|2002h . We use two of these parameters in our 
analysis: the bulge light fraction (B/T) and the disc incli- 
nation (i). The bulge fraction is given by the flux from the 
bulge component divided by the total flux from both the 
bulge and disc components. A pure disc has B/T = 0. The 
disc inclination is the angle between the vertical axis of the 
fitted disc and the line of sight, in degrees. A face-on disc 
has i = 0. 

In addition, GIM2D also computes image indices from 
the residual image (galaxy image minus model fit). Two 
of these indices, the total residual (Rt) and the asymmet- 
ric residual (Ra), measure the galaxy's deviation from the 
smooth model ligh t profile. Quantitativ ely, these are defined 
in Equation 11 bv lSimard et all (|2002l ): 



Rt = 



X(l/2)\Rjj + Rj?°\ £(i/2)|£, 



Bjf°\ 



Eh. 



Eh 



and 



Ra 



E(l/2)\Rij - R\f\ 
Eh, 



E(l/2)\B t] -Bj 
EI„ 



(1) 



(2) 



where Rij is the pixel value of an object pixel in the resid- 
ual image, hj is the corresponding pixel value in the science 
image, and Bij is the pixel value of a random background 
pixel in the science image. Rjj° and B, 1 / are the corre- 
sponding galaxy and background pixel values in the resid- 
ual image after a 180 degree rotation about the centre of 
the galaxy. The position of the centre is specified by two of 
the 12 free parameters that are fit in GIM2D. We were able 
to obtain GIM2D parameters for 986 galaxies. The GIM2D 
fits used here were derived by simultaneous fitting of g- and 
r-band images, SDSS deblending, and sky levels determine d 
by GIM2D; the details are described in lSimard et all l|2010h . 

In our analysis, we will be using the sum of the residuals 
Rt and Ra, called the smoothness parameter s2. The '2' in- 
dicates that Rt and Ra are calculated within two half-light 
radii of the galaxy. Actually, a better name for s2 would be 
the 'unsmoothness' parameter, as high s2 indicates a clumpy 
light distribution (large Rt), the presence of asymmetric 
features (large Ra), or both. 

We remove two galaxies with suspicious values of s2. 
Galaxies with a bright star or galaxy companion near their 
centres fall in the tails of the s2 distribution. Some of the 
galaxies with the lowest values of s2 have large compan- 
ion galaxies superimposed. Two objects have s2 < 0.0, and 
both are overwhelmed by the light of the nearby, larger 
galaxy. This makes the background subtraction difficult, 
which causes the value of s2 to be negative. These fits ap- 
pear to be unreliable, so we omit all targets with s2 < 0.0 
from the sample. 

Some of the galaxies with the highest values of s2 have 
bright stars visible in the field, and their high s2 values 
may be an artefact of the superimposed extra object. These 
are difficult to distinguish from objects with genuinely un- 
smooth features with a simple cut in s2 (unlike the case of 
objects with negative s2), so we leave them in the sample. 

The final sample consists of 984 galaxies that are large 
enough to classify visually and have valid automated pa- 
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BC = 3, FEAT = 1 (215 objects) 
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BC = 3, FEAT = 2 (135 objects) 
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BC = 3, FEAT = 3 (51 objects) 
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Figure 2. Examples of visual discs (BC = 3). These are objects that have a sharp outer edge where the light drops off rapidly, with a 
flatter light profile at intermediate radii just inside the outer boundary. Row f : Smooth, featureless discs (FEAT = 1). Row 2: Unsmooth 
discs with weak features (FEAT = 2). Row 3: Unsmooth discs with strong features (FEAT = 3). Columns show the kinds of features 
noted in our visual classification scheme, from left to right: Spiral Arms, Bars, Clumps, Rings, and Dust. Some features, such as clumps 
and dust, show better on a computer monitor and are not reproduced well on paper. 



rameters in the redshift range 0.02 < z < 0.08. Because the 
galaxies cover a relatively small redshift range, errors from 
the K-correction should be small. Table [1] presents visual 
classifications, SDSS identifiers and photometry, GALEX 
photometry, and GIM2D parameters for the sample. A full 
electronic version of the table is available. 



2.5 Preliminary conclusions based on visual 
classifications 

In this section we present some findings based on the visual 
classifications of the final sample of 984 galaxies. First, a 
large fraction of these early type galaxies are found to have 
strong discs. Our sample consists of 346 bulges (35 per cent, 



BC — 1), 237 intermediates (24 per cent, BC = 2), and 401 
discs (41 per cent, BC = 3). Thus, our analysis confirms 
what had been known before, that SOs and Sas (i.e., discs) 
comprise a very significant fraction of red sequence galaxies. 

Figure |3] shows a color-magnitude diagram of the sam- 
ple, with colors and symbols denoting bulge class BC. 
The panels at the top and right show the magnitude and 
color distributions, respectively, of bulges, intermediates, 
and discs. While the colors of all three classes peak at about 
the same value, the bulges have the narrowest distribution, 
and most of the outliers are intermediates or discs. 

As expected, most of the brightest objects are giant 
ellipticals, and most of the faintest objects are discs. There 
are, however, also many bright discs. At magnitudes brighter 
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BC = 2, FEAT = 1 (199 objects) 
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BC = 2, FEAT / 1 (38 objects) 
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BC = 2, DIST £ 1 (45 objects) 
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Figure 3. Examples of visual intermediates (BC = 2). These galaxies are judged to be intermediate between the visual bulge and disc 
categories. Row 1: Smooth, featureless galaxies (FEAT = 1) with light profiles that are not obviously bulgy or discy. Row 2: Same as 
Row 1, but with features (FEAT ^ 1). All types of features found in our visual classification scheme are seen in intermediate galaxies; 
most of the features are weak. Row 3: Examples of problematic objects that have been classified as intermediate due to their disturbed 
morphologies, which make the edge profiles hard to assess. 



Table 1. Visual Classifications and Automated Parameters for 984 Galaxies. 
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(2) 
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(14) 
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438 


51884 


284 
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120.059 


46.043 


-19.258 


-20.967 


-21.851 


-22.086 


-22.455 


-17.694 


-16.387 


10.798 


22 


771 


52370 


341 


587725475491610665 


150.930 


61.739 


-18.730 


-20.646 


-21.415 


-21.814 


-22.110 


-15.574 


-234.919 
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17 



Full version available online. Columns: (1-11) SDSS Plate, MJD, Fiber ID, ObjID, RA, Dec, K-corrected ugriz ; (12-13) GALEX 
K-corrected NUV, FUV; (14-16) SDSS Petrosian R 50 , Petrosian Rg , Axis Ratio b/a; (17-19) GIM2D bulge fraction B/T, smoothness 
s2, inclination i; (20-28) Visual Classification BC, FEAT, ARMS, BAR, CLUMPS, RING, DUST, DIST, INT; (29) Automated 

Classification: 1-Bulgc, 2-Intermediate, 3-Disc 
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Figure 4. Color magnitude diagram of the final sample of 984 galaxies that are large enough to classify visually and have v alid automated 
param eters. Absolute magnitudes are SDSS model fluxes k-corrected to z = 0.0 using Michael Blanton's k-correct code jBIanton et al.l 
|2003| . v3_2). The symbols indicate the visual classification BC: bulges (BC = 1, red circles), discs (BC = 3, blue bars), and intermediates 
(BC = 2, green circles+bars). The dashed line indicates the color cut that was described in £|2,ll Using only the emission line criteria 
described in the text to remove star forming galaxies yields a sample of predominantly red and green galaxies. The top and right panels 
show histograms of each visual type in magnitude and color, respectively. The visual bulges (red) occupy a large range in magnitude but 
a narrower range in color compared to the visual discs (blue dash-dotted) and intermediates (green dashed). 



than M r = -22, there are 121 bulges (BC = 1), 41 interme- 
diates (BC — 2) , and 69 discs (BC — 3) . The objects in this 
last group are truly discs: 48 of the 69 discs brighter than 
Mr = -22 have features (FEAT / 1), which would likely 
cause them to be classified as Hubble Types Sa or Sb. We 
will be returning to these galaxies in N3.ll 

Second, 229 (23.3 per cent) galaxies show features 
(FEAT 7^ 1). The frequency of each feature type is listed in 
Table [5] which shows the number of galaxies with each BC 
with various features at different strengths. Very few bulges 
(BC = 1) have features, and those that do have dust. In 
contrast, ~ 80 per cent of face-on BC = 3 galaxies show 
features, nearly all of which are associated with gas and/or 
dynamically cold stellar populations, both of which are well- 
known disc tracers. The evidence from FEAT is therefore 
a strong independent validation that our main disc crite- 
rion based on sharp outer brightness profile fall-off is indeed 



highly correlated with the presence of a dynamically cold, 
rotating stellar population. 

Finally, only 99 galaxies (~ 10 per cent) appear to have 
disturbed morphologies, and only 13 (~ 1.3 per cent) ap- 
pear to be interacting with nearby neighbors. Nearly half of 
these were classified as intermediates (BC = 2) , as their dis- 
turbed appearances often made it difficult to identify them 
as having clear bulge or disc morphologies. However, we can 
already see that the SDSS thumbnails are not the best ma- 
terial to judge disturbances, and evidence of more subtle 
peculiarities is often difficult to see by eye. Future work on 
these objects may be done using GIM2D's residual images, 
which are a much more powerful tool for this kind of analy- 
sis. Furthermore, the limiting surface brightness of the SDSS 
is too brig ht to see the f a inter tidal features such as those ob- 
served by Ivan Dokkuml (|2005l ). whose much deeper imaging 
showed that 53 per cent of their red galaxy sample exhibited 
signatures of tidal interactions. 
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Table 2. Frequency of Features for 984 Galaxies. 
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The upper part of the table shows the numbers of objects with the five kinds of 
features that make up the FEAT classification, which collects together all types of 
features that might signal a disc. For each visual morphological type, the number 
of galaxies with each feature strength is shown. The last column (cos i > 0.8) lists 
the frequencies for face-on discs only. Note that visual bulges {BC = 1) show no 
features except dust lanes. The lower part of the table shows the number of objects 
with each value of DIST and INT. About 10 per cent of the sample is disturbed or 
interacting with another galaxy. 



3 AUTOMATED CLASSIFICATION SCHEME 3.1 Bulges vs. discs 



We have identified three populations - bulges, smooth discs, 
and unsmooth discs - in our sample of 984 galaxies using 
a visual classification system. In this section, we show that 
we are able to reproduce the visual classifications using a 
set of automated parameters derived from the models fit by 
GIM2D together with photometric measurements available 
from the data products of the SDSS. This automated classifi- 
cation method will allow us to assign morphologies to large 
numbers of SDSS galaxies in future work without having 
to visually inspect each one. We will refer to the resulting 
morphological samples as the automated bulge, automated 
intermediate, and automated disc samples. 

We initially assume that the visual classifications are 
the 'true' classifications and determine what combinations 
of automated parameters are most effective in reproducing 
the visual morphological types. We divide the automated pa- 
rameter space and assign an automated classification to each 
region based on which visual classification is most common 
in that region. With the automated classification scheme in 
place, we examine more closely the galaxies where the visual 
and automated classifications are discrepant. These exam- 
ples shed light on the strengths and weaknesses of the visual 
classifications as well as on the accuracy of the machine- 
generated measurements made by GIM2D and SDSS. 

In addition, some instances in which the automated pa- 
rameters are less reliable are discussed in iJX] Briefly, B/T is 
underestimated for luminous bulge-dominated galaxies that 
are more concentrated than a profile with Sersic index n — 4 
and C is least reliable for the smallest and/or most elon- 
gated galaxies. Because our automated scheme uses several 
parameters in conjunction, the results presented in i]3.1l and 
i|3.2l are unlikely to be strongly influenced by these findings. 



3.1.1 Method 

Figure[5]shows the correlation between the visual bulge class 
index (BC) and the GIM2D bulge fraction (B/T). As ex- 
pected, visual bulges (BC = 1, red) typically have higher 
B/T than visual discs (BC = 3, blue dash-dotted). We di- 
vide the sample into two populations, one with high bulge 
fraction (B/T > 0.5), which includes most of the visual 
bulges, and one with low bulge fraction (B/T ^ 0.5), which 
includes most of the visual discs. We consciously set the 
B/T boundary a bit low (at 0.5, rather than 0.55 or 0.6) to 
keep most of the visual bulges together in the same sample. 
Interloper discs and intermediates (BC = 2, green dashed) 
will be removed from the high B/T sample later using other 
automated parameters. 

The above cut yields good general agreement between 
the visual classification BC and the GIM2D parameter B/T, 
which indicates that both methods are able to distinguish 
between different light profiles with reasonable reliability. A 
comparison of the bulge and disc galaxies in Figures [T] and 
[2] clearly shows the contrast between the two types. The 
bulges (BC = 1, Figure [TJ) appear more extended and have 
no clear outer boundary, while the discs (BC = 3, Figure [2] 
Row 1) have a visible edge. These two groups of objects were 
identified initially by eye but have different values of B/T, 
which indicates that we are able to make the distinction 
between bulges and discs both visually and automatically. 

The number of each visual type falling on both sides of 
the B/T = 0.5 cut are listed in Tabled The completeness is 
the percentage of all visual bulges (discs) that are recovered 
by the B/T criterion, while the purity is the percentage 
of all galaxies in the high-B/T (low-B/T) sample that are 
truly visual bulges (discs). Based on these results, if given 
only B/T, we can reasonably say that galaxies with B/T ^ 
0.5 are most likely visual discs; the purity of the \ow-B/T 
sample is 79 per cent. Galaxies with B/T > 0.5, however, 
cannot all be assumed to be visual bulges; half of high- B/T 



Automated Morphological Classification of SDSS Red Sequence Galaxies 11 



Bulge (BC=I) 
Intermediate (BC=2) 
Disc (BC=3) 



Otll 
0.0 





0.2 



0.4 0.6 
GIM20 Bulge Fraction (B/T) 



0.20 



0.15 



0.10 



0.05 



Disc 



Disc 



= % 
a=°_=e» . 



oo 



Bulge 



0.0 



0.2 0.4 0.6 0.8 

GIM20 Bulge Fraction (B/T) 



1.0 



Figure 5. Histogram comparing the GIM2D bulge fraction 
(B/T) and the visual classifications BC: bulges (BC = 1, red), 
discs (BC = 3, blue dash-dotted), and intermediates (BC = 2, 
green dashed). Most galaxies with B/T ^ 0.5 are visual discs, but 
only about half of the galaxies with B/T > 0.5 are visual bulges. 
The presence of a large number of visual discs and intermediates 
in the high- B/T sample indicates that B/T alone is not able to 
reproduce all of the visual classifications. The numerical results 
are tabulated in Table [3] 



Table 3. Visual Bulge Class vs. GIM2D Bulge Fraction. 



Figure 7. GIM2D Bulge Fraction B/T vs. GIM2D Smoothness 
s2. Symbols indicate visual classifications as defined in Figure [4] 
Shaded regions indicate the adopted automated classifications. Of 
the high-B/T galaxies, most visual bulges (red circles) are smooth 
(s2 ^ 0.08), while visual discs and intermediates are more likely 
to show dumpiness or asymmetry (s2 > 0.08). We assign the 
high-s2 galaxies to the automated disc sample (blue shading). 
However, many visual discs and intermediates still remain in the 
low-s2 region containing candidates for the automated bulge sam- 
ple (pink shading) which leads us to explore the additional cuts 
in Figures 151 and l9l 



Visual GIM2D Completeness 

B/T > 0.5 B/T s: 0.5 



Bulge (BC = 1) 


330 


16 


95.4 


Int. (BC = 2) 


196 


11 




Disc (BC = 3) 


186 


215 


53.6 


Purity 


46.3 


79.0 





Comparison of visual bulge class BC and bulge fraction B/T. 
Bold values indicate instances where the methods agree. The 
completeness is the percentage of all visual bulges (discs) that 
are recovered by the B/T criterion. The purity is the percentage 
of all galaxies in the high- B/T (low-S/T) sample that are truly 
visual bulges (discs). The purity of the high B/T sample is 
relatively low (46.3 per cent), which indicates that B/T alone 
cannot reproduce the visual classifications. 

galaxies are determined visually to have moderate (BC = 2) 
or strong (BC — 3) discs. 

To understand this disagreement we examine the visual 
discs with bulge fractions well above the B/T — 0.5 bound- 
ary (BC = 3, B/T > 0.6); these make up 28 per cent of the 
visual disc sample. Examples are shown in Figure [5] Pos- 
sible explanations for the disagreement are discussed below 
( 33.1.2( 1. but the point for now is that the great majority 
of these cases are indeed really disc-dominated systems and 
B/T alone is insufficient to reproduce our visual classifica- 
tions. 

Three other parameters allow us to distinguish the 
different morphologies within the high- B/T sample: the 
GIM2D smoothness parameter s2, the SDSS axis ratio b/a, 
and the SDSS concentration index C. We apply cuts in these 



parameters successively using an algorithm that starts with 
s2, proceeds to b/a, and ends with C (the order is discussed 
at the end of this section). We begin with Figure [7] which 
shows B/T plotted against s2 for the 984 galaxies in our 
sample. As in Figure[4]the symbols represent the three types 
of visual classification (BC): bulge (red circles), disc (blue 
bars), and intermediate (green circles+bars). The shaded re- 
gions represent the adopted automated classification bound- 
aries, which will be described in more detail below. 

All of the galaxies with B/T ^ 0.5 are classed as au- 
tomated discs (shaded in blue), because this cut gives a 
relatively pure sample of discs (79 per cent, see Table 2). 
Figure [7] shows again that this region is clearly dominated 
by visual discs. We now turn our attention to the high-B/T 
galaxies. We would like to use s2 to separate the bulges and 
discs with B/T > 0.5. 

For a given parameter (e.g., s2), the boundary between 
different types is chosen by eyeballing the value at which the 
local densities of visual bulges and discs are roughly equal. 
In general, this can be done by making a cut which places the 
tail of the visual bulge distribution into the automated disc 
sample, so that most visual bulges (with the exception of 
some outliers) are included in the automated bulge sample. 
A cut at s2 = 0.08 successfully distinguishes many of the 
visual intermediates and discs from the bulges in the high- 
B/T sample, while also retaining most of the visual bulges 
in the automated bulge sample. We therefore designate the 
region B/T > 0.5, s2 ^ 0.08 as containing candidates for 
the automated bulge sample (shaded in pink). These are as 
yet just candidates; their bulge status will be re-evaluated 
based on the values of the other automated parameters. 

The next step is to determine what automated classifi- 
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BC = 3, B/T > 0.6 (111 objects) 
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Figure 6. Examples of visual discs (BC = 3) that are bulge-dominated according to GIM2D (B/T > 0.6). All of these galaxies satisfy 
the criterion of having a clear edge, as described in the text. Visual inspection of these galaxies convinces us that high-B/T galaxies are 
contaminated with a significant fraction of discs and thus that B/T alone is not a reliable way to distinguish between bulges and discs. 



cation to assign to galaxies in the region B/T > 0.5, s2 > 
0.08. Because most of the objects in this region are visual 
discs (90 of 169), we permanently assign all galaxies in this 
region to the automated disc sample (shaded in blue). The 
79 discrepant objects include 48 visual intermediates and 31 
visual bulges. By visual inspection, almost all of the latter 
have high smoothness parameters because of dust, disturbed 
shapes, nearby companions, or bright stars in the field. Thus, 
in assigning all of these objects to the automated disc sam- 
ple, we make an error of at most one class for the great 
majority of objects and misclassify at most a handful of 
problematic bulges that would be difficult to deal with in 
any case. 

Proceeding to the next step, Figure [8] shows B/T plot- 
ted against b/a for the 984 galaxies in our sample. Most 
visual bulges have high axis ratios (i.e., they are round), so 
objects in the region B/T > 0.5, b/a > 0.65 are designated 
as candidates for the automated bulge sample (shaded in 
pink). Based on the numbers of each visual type present, the 
regions B/T > 0.5, 0.45 < b/a < 0.65 and B/T > 0.5, b/a < 
0.45 are permanently assigned to the automated intermedi- 
ate (shaded in green) and automated disc (shaded in blue) 
samples, respectively. More elongated galaxies are classed 
as visual intermediates or discs, consistent with traditional 
Hubble Types, in which elliptical galaxies cannot intrinsi- 
cally have b/a < 0.33 (the most elongated ellipticals are E7). 
The possibility of a bias in our visual classifications due to 
inclination effects is discussed below ( i|3.1.2|) . For now, we 
assume that our visual classifications are representative of 
the 'true' morphologies. 

Finally, Figure [9] shows B/T plotted against C for the 
984 galaxies in our sample. The distributio n of visual bulges 
(red c ircles) is consistent with the r esults of lShimasaku et al] 
j200ll ) and IStrateva et~ai1 (|200ll ), who found that C is 
closely correlated with a galaxy's morphology. Most visual 
bulges are highly concentrated, so objects in the region 
B/T > 0.5, C > 2.9 are designated as candidate automated 
bulges (shaded in pink). Of the 87 objects in the region 
B/T > 0.5, C ^ 2.9, 39 have already been classified as auto- 
mated discs based on their values of s2 or b/a, and another 
16 have been classified as automated intermediates based on 
their values of b/a. Of the remaining 32, 9 are visual bulges, 
13 are visual intermediates, and 10 are visual discs. Because 
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Figure 8. GIM2D Bulge Fraction B/T vs. SDSS Axis Ratio 
b/a. Symbols indicate visual classifications as defined in Fig- 
ure [4] Shaded regions indicate the adopted automated classifi- 
cations. Of the high-S/T galaxies, most visual bulges are round 
(b/a > 0.65) while visual discs and intermediates are more elon- 
gated (b/a ^ 0.65). We assign the lowest-fe/a (b/a ^ 0.45) 
galaxies to the automated disc sample (blue shading) and the 
intermediate-fe/a (0.45 ^ b/a < 0.65) galaxies to the automated 
intermediate sample (green shading). The possibility of a bias in 
the visual classifications due to inclination effects is discussed in 
the text ( jpjXg) , 



most of the unclassified objects in this region are visual in- 
termediates, we assign them permanently to the automated 
intermediate sample (shaded in green). Five of these galaxies 
were noted during visual classification as being small galax- 
ies that were not eliminated from the sample using the log 
7?90 ^1.15 size cut described in i]2.2l Additional discussion 
of the high-C, small, and elongated galaxies in this figure is 
given in the appendix. 

To summarize, in Figures [7] [8] and [9] our visual bulges 
are generally round (high b/a), smooth (low s2), and cen- 
trally concentrated (high C). We therefore can say that a 
galaxy which occupies all three of the pink shaded regions 




Automated Morphological Classification of SDSS Red Sequence Galaxies 13 




0.2 0.4 0.6 0.8 

GIM20 Bulge Fraction (B/T) 



B/T 


>0.5 ( 


s? 


S0.08 , 


b/o 


>0.65 


c 











Disc 



s2 



>0.08 



S2.9 



Unsmooth 




Smooth 




Int. 




Bulge 


















Figure 9. GIM2D Bulge Fraction B/T vs. SDSS Concentration 
C. Symbols indicate visual classifications as defined in Figure [4] 
Shaded regions indicate the adopted automated classifications. 
Of the high-B/T galaxies, most visual bulges arc highly concen- 
trated (C > 2.9). Most of the heretofore unclassified galaxies in 
the region B/T > 0.5, C ^ 2.9 are visual intermediates, so we 
assign them to the automated intermediate sample (green shad- 
ing). The appendix has further discussions of the high-C, small, 
and elongated galaxies in this figure. 



in Figures [7] [8] and [9] is likely to be a visual bulge and should 
therefore be put in the automated bulge sample. 

Numerically, the automated bulges satisfy the following 
criteria: B/T > 0.5, s2 < 0.08, b/a > 0.65, and C > 2.9. 
Similarly, objects are classified as automated discs if they 
satisfy one of the following criteria: B/T ^ 0.5 or s2 > 0.08 
or b/a ^ 0.45. Objects are classified as automated interme- 
diates if they satisfy B/T > 0.5, s2 < 0.08, b/a > 0.45 and 
(C < 2.9 or b/a < 0.65). In words: 

(i) To be an automated bulge, a galaxy must have a high 
bulge fraction and be smooth and roundish. 

(ii) To be an automated disc, a galaxy either has to have 
low bulge fraction or be lumpy or be elongated. 

(iii) Automated intermediates are all other cases. 

This logic is depicted as a flowchart in Figure 1101 These are 
the automated classifications used in the remainder of the 
paper. 

It should be noted that the classifications of automated 
discs and intermediates depend somewhat on the precise or- 
der in which the cuts are applied. The above order was cho- 
sen so as to maximize the purity of the automated samples. 
For example, there are 34 galaxies for which the order of the 
s2 and C cuts would change their automated classifications. 
This group (B/T > 0.5, s2 > 0.08, C 2.9), which consists 
of 1 visual bulge, 7 visual intermediates, and 27 visual discs, 
is placed in the automated disc sample if the s2 cut is ap- 
plied first and in the automated intermediate sample if the 
C cut is applied first. Since the majority of these galaxies 
are visual discs, the preferred order is s2 first. 

Similarly, there are 4 visual bulges, 26 visual intermedi- 
ates, and 24 visual discs that satisfy the criteria s2 > 0.08, 
0.45 < b/a < 0.65; this region is designated to be part of 
the automated disc sample by making the s2 cut first. And 



Figure 10. A schematic diagram of the automated method. All 
984 galaxies in our sample can be classified as automated bulges, 
intermediates, smooth discs, or unsmooth discs. The numerical 
results are tabulated in Tables [4] and [5] 



Table 4. Visual vs. Final Automated Bulge and Disc Classifica- 
tions. 



Visual 




Automated 




Completeness 






Bulge 


Int. 


Disc 




Bulge (BC 


= 1) 


260 


39 


47 


75.1 


Int. (BC = 


2) 


68 


70 


99 


29.5 


Disc (BC = 


= 3) 


28 


41 


332 


82.8 


Purity 




73.0 


46.7 


69.5 





Same as Table [3] but using the final, refined automated 
classification recipe described in the text. 

lastly, there are one visual intermediate and 7 visual discs 
that satisfy the criteria b/a ^ 0.45, C ^ 2.9; this region is 
designated to be part of the automated disc sample by mak- 
ing the b/a cut before the C cut. This gives the best possible 
order as s2, b/a, then C. 

The final comparison between visual and automated 
types is shown in Table |4j which repeats Table [3] using the 
improved automated bulge, disc, and intermediate classifica- 
tions rather than using only B/T. Comparison with Table [3] 
shows that the purity of the automated bulge sample is sig- 
nificantly improved over that of the high- B/T sample, from 
46.3 per cent to 73.0 per cent. The purity of the automated 
disc sample is ~ 10 per cent lower than that of the \ow-B/T 
sample, but its completeness is ~ 29 per cent higher. With 
the additional parameters s2, C, b/a, the purities of both 
the automated bulge and disc samples are ~ 70 per cent; 
the completenesses are 75 per cent and 83 per cent, respec- 
tively. Of the 69 brightest visual discs (BC = 3, M r < -22) 
discussed in i]2.5l 60 (87 per cent) are correctly classified 
as automated discs. Indeed, 42 of them satisfy the initial 
condition of B/T < 0.5. 

In summary, the agreement between the visual and au- 
tomated methods appears to be quite good. Furthermore, 
the disagreement between methods is rarely more than one 
type. That is, the contamination across types (i.e., BC = 3 
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galaxies in the automated bulge sample or BC = 1 galax- 
ies in the automated disc sample) is < 10 per cent in each 
automated sample. Thus, we are confident that the auto- 
mated classifications can be extrapolated to determine the 
morphologies of a large sample that is not inspected visually, 
provided that sample has the same basic selection parame- 
ters as this well-tested sample. 

3.1.2 Disagreements between visual and automated 
classifications 

We now discuss cases where the visual and automated meth- 
ods disagree in order to understand the sources of the dis- 
crepancies. First, we consider the visual discs that are classi- 
fied as automated bulges. Many have low surface brightness 
discs, which points to a subtle difference between the two 
classification schemes. For the visual scheme, the criterion 
is that a disc be present and dominate the light in the outer 
regions. For the automated scheme, the criterion is that the 
disc contain more of the galaxy's total light than the bulge. 
In other words, the light of the galaxy may be truly bulge- 
dominated in terms of the total light, but the presence of a 
faint disc causes it to be classified as a visual disc. 

Another possible source of discrepancy is that GIM2D 
fits are performed on r-band images of the galaxies, while the 
visual classification is done using multi-band color images. 
In the r-band the bulge component may contribute more 
light, boosting B/T. Whatever the case may be, we conclude 
that the automated bulges are definitely failing to find some 
visual discs, but the actual amount of disc light in these 
interesting objects may be low and needs to be quantified 
via more careful modeling in the future. 

In the reverse case - visual bulges that are classified as 
automated discs - some have non-smooth features (such as 
dust, a double nucleus or a superposed star), which cause the 
galaxy to have s2 > 0.08. Others are visual bulges that have 
B/T ^ 0.5 (i.e., automated discs). Their images support the 
visual classification, and one possibility is that these galaxies 
may simply be the error tail of the B/T distribution, as most 
of them fall near the B/T = 0.5 boundary; only three visual 
bulges have B/T sC 0.4. 

We also consider the possibility that a bias in the visual 
classification scheme causes face-on discs (hidden within the 
bulge component) to be more difficult to identify by eye. 
Such galaxies would be more likely to be classified visually as 
bulges or intermediates. Figure [TTT a) shows the distribution 
of disc inclinations (cos i) determined by GIM2D. The upper 
black line shows the distribution for all visual discs. The lack 
of galaxies at cos i = 0.0 and the spike at cos i ~ 0.2 appear 
to be an artefact from the GIM2D fits that is seen in the 
distribution of the total sample, so it should not be taken as 
evidence for a bias in finding discs. This feature is likely due 
to the fact that a galaxy cannot be perfectly edge-on because 
real discs are not infinitely thin as GIM2D assumes so that 
truly edge-on discs are piled up at cos i ~ 0.2. Above cos i > 
0.3, there does appear to be a slight decline in the number 
of visual discs as cos i increases. A KS test comparing the 
distribution at cos i > 0.3 to a flat distribution yields a 
probability of 3.9%. We conclude from this that there is 
indeed a slight bias against visually identifying face-on discs; 
the effect is statistically significant, but small. 

Figure[l2]cxplores misclassifications as a function of ab- 
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Figure 13. Histograms showing the correlation between GIM2D 
smoothness s2 and the visual classifications FEAT for automated 
discs. Most automated discs with s2 Sj 0.08 are visually smooth 
(red, FEAT = 1), while most automated discs with s2 > 0.08 
are visually unsmooth (solid blue, FEAT 1). The numerical 
results arc tabulated in Table [5] 

solute magnitude by plotting separate color-magnitude dia- 
grams of (a) the automated bulge sample and (b) the auto- 
mated disc sample, with symbols indicating the visual clas- 
sifications in both panels. The discrepant galaxies in both 
automated samples may tend toward fainter magnitudes, 
but the effect is more pronounced for visual discs classified 
as automated bulges (blue bars, panel a). 

3.2 Smooth vs. unsmooth discs 

To study the properties of quenched galaxies in a large sam- 
ple, we would like to be able to identify star-forming con- 
taminants in our sample. Visually these are unsmooth discs 
with FEAT ^ 1 and make up 23 per cent of our total sam- 
ple. In this section, we will show that we are able to use 
GIM2D smoothness s2 to reliably remove star-forming con- 
taminants from future samples using only their automated 
morphological classifications. The following analysis applies 
only to the automated disc sample described above. We use 
the automated discs because in the future, in the absence of 
visual inspection, we will be able to use only the automated 
parameters. 

Figure QjJ] shows the correlation between FEAT and 
s2. The distribution of visual smooth discs (FEAT — 1, 
red line) is centred at a lower value of s2 than visual un- 
smooth discs (FEAT 1, blue solid line). We therefore di- 
vide the sample into two populations, one with low smooth- 
ness parameter (s2 ^ 0.08), which includes half of the vi- 
sual smooth discs, and one with high smoothness parameter 
(s2 > 0.08), which includes most of the visual unsmooth 
discs. The numbers of each visual type falling on both sides 
of the s2 = 0.08 cut are listed in Table [S] The purity of the 
automated smooth disc sample is high (78 per cent), but the 
purity of the automated unsmooth disc sample is rather low 
(49 per cent). The low purity of the automated unsmooth 
disc sample may be due to a bias in the visual classifications. 

As in the previous section, we examine cases where 
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Figure 11. Histograms of GIM2D disc inclination angles of (a) visual discs (BC = 3) and (b) automated discs, (a) The distribution 
of visual disc inclinations indicates a slight bias for finding visual discs at higher inclinations. In an unbiased sample, we expect the 
distribution of cos i to be flat. Above cos i > 0.3, the observed distribution is nearly flat, but there are slightly more edge-on than 
face-on discs. A KS test suggests that the bias exists but is small. A stronger bias is that visual unsmooth discs are much more likely 
to be face-on (FEAT ^ 1) than visual smooth discs (FEAT = 1). (b) Inclination biases in automated discs are smaller. The overall 
distribution as well as the automated smooth (s2 ^ 0.08) and unsmooth (s2 > 0.08) samples have much flatter distributions than the 
corresponding visual samples. GIM2D is thus able to identify discs more uniformly than the visual method and can spot non-smooth 
features at all inclinations. 




Figure 12. Same as Figure [4] but divided according to automated type: (a) the automated bulge sample of 356 galaxies and (b) the 
automated disc sample of 478 galaxies. Comparison with the histograms of Figure [4] reveals that discrepant galaxies in both automated 
samples may tend toward fainter magnitudes, but the effect is more pronounced for visual discs classified as automated bulges (blue 
bars, panel a). 



the two methods disagree. Many visual unsmooth discs 
(BC = 3, FEAT 7^ 1) that are classified as automated 
smooth discs (s2 ^ 0.08) have rather weak features, and 
nearly all have features that are reflection symmetric. Be- 
cause the smoothness parameter is in part a measure of 
asymmetry, it is plausible that a symmetric, face-on spiral 
or a disc with a bar may be featureless by the automated 
criteria. One possible way to check this quantitatively is to 
treat the total residual (Rt) and asymmetric residual (Ra) 
separately, though for simplicity, we do not attempt it here 
because these galaxies make up less than five per cent of our 
sample. 



For the reverse case, visual smooth discs (BC = 
3, FEAT = 1) that are classified as automated unsmooth 
discs (s2 > 0.08), there is a strong tendency for the galaxies 
to be edge-on. This bias is understandable: with the ex- 
ception of dust, which is seen almost exclusively in edge-on 
discs, features are more likely to be picked out visually if a 
galaxy is face-on. In a highly inclined disc the light is pro- 
jected on to a smaller area so it is more difficult to see, by 
eye, a contrast in brightness between a spiral arm or a bar 
and the rest of the light in the galaxy. It seems clear that the 
visual classifications are not spotting all features in highly 
inclined galaxies. 
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Table 5. Visual vs. Automated Smooth and Unsmooth Classifi- 
cations (Automated Discs Only). 



Visual 




Automated 


Completeness 






s2 < 0.08 


s2 > 0.08 




Smooth (FEAT = 1) 




158 


141 


52.8 


Unsmooth (FEAT = 


2) 


38 


90 


70.3 


Unsmooth (FEAT = 


3) 


7 


44 


86.3 


Purity 




77.8 


48.7 





Same as Table [3] but comparing visual and automated smooth 
vs. unsmooth discs. 



This is shown more quantitatively in Figure lllf a). 
where the blue hatched histograms show the disc inclina- 
tion distributions for visual smooth and unsmooth discs. As 
in the previous discussion of inclination bias ( >13.1.2[1 . the 
distribution of cos i should be flat because all inclinations 
are equally likely. In Figure [TTT a). however, the number of 
visual smooth discs increases steadily as the inclination be- 
comes closer to edge-on (cos i = 0), while the number of 
visual unsmooth discs increases steadily as the inclination 
becomes closer to face-on (cos i = 1). The increase is smooth 
and regular, which is consistent with a real bias in identify- 
ing features in inclined discs. 

The behavior of automated smooth and unsmooth discs 
is very different, as shown in Figure lllf b). The distribu- 
tions are much flatter, which may indicate that GIM2D is 
much better than visual classification at finding substruc- 
ture in edge-on discs. With this in mind, we re-examine the 
results shown in Table O restricting attention in the right- 
most columns to face-on galaxies with cos i > 0.8. We see 
that visual features (FEAT ^ 1) are much more common 
in face-on discs than when considering all of the visual discs 
( H2.5I) . The bias against finding edge-on visual unsmooth 
discs may explain the apparent low purity of the automated 
unsmooth disc sample (Table [5]). That is, these galaxies may 
really possess features but the visual method fails to detect 
them. 

Based on the results of this section, 275 star-forming 
contaminants (automated unsmooth discs) can be removed 
from our original sample of 984 using the automated clas- 
sification scheme. Of the remaining 709 quenched galaxies, 
only 67 (9.4 per cent) are visual unsmooth discs. Figure fTTI 
indicates that many of the visual smooth discs have under- 
lying features that are difficult to pick out by eye because 
of inclination effects. This is not a problem, however, for 
the automated method, so future samples that are selected 
using the automated parameters will not be severely con- 
taminated. 

As a final check of how severe residual contamination 
by star-forming galaxies is in our sample, we compare our 
sample of unsmooth discs to galaxies that have the bluest 
GALEX NUV - r colors. As GALEX photometry is not 
available for all SDSS galaxies, we do not want to use it 
in selecting our sample. Nevertheless, the UV photometry 
can serve as a valuable check on the presence of young stars 
for the smaller sample of galaxies presented here. Figure [14] 
shows a CMD of our sample using the GALEX NUV - r 
color. There is an obvious tail of blue galaxies with colors 
that suggest that they are star-forming galaxies. This tail 



is largely absent in the g — r CMD of Figure [4] indicat- 
ing that g — r is not sensitive enough to young stars for 
our purposes. This is true even with improved photome- 
try from GIM2D. In the following, we will compare galaxies 
with NUV — r ^ 4.5 with the unsmooth discs; these are 
indicated in blue in Figure Q3] for both the (a) visual and 
(b) automated classifications. 

In the original sample of 984 galaxies, 158 have NUV — 
r ^ 4.5; 90 of these are visual unsmooth discs and 78 are au- 
tomated unsmooth discs, which suggests that roughly half of 
blue galaxies can by identified morphologically using either 
method. In the remaining sample of 709 nominally quenched 
galaxies, 80 have NUV — r ^ 4.5. If we take all of these 
galaxies as being star-forming, about 10 per cent of the 
'quenched' sample is made up of star-forming contaminants. 
This should be an upper limit, as some of the galaxies with 
NUV — r ^ 4.5 may simply be the tail of the red sequence. 

Based on the above analysis, we believe that approx- 
imately half of the star-forming contaminants can be ef- 
fectively removed by identifying them morphologically. In 
the remaining sections, we analyze only the quenched sam- 
ple, that is, bulges, intermediates, and smooth discs. The 
275 star-forming contaminants identified by their unsmooth 
morphology in this section are not included in the following 
analysis. We also examine whether the remaining blue con- 
taminants - those that were not identified by their unsmooth 
morphology - affect on the result. 



4 MORPHOLOGIES VS. STRUCTURAL 

PARAMETERS USING BOTH METHODS 

Now that both visual and automated classifications are in 
place, the remainder of this paper employs them in a first re- 
connaissance of how the morphologies of quenched early type 
galaxies vary versus galaxy structural parameters. Because 
bulges and discs may have experienced different quenching 
mechanisms, the frequencies of each type as a function of 
various structural parameters will provide constraints for 
quenching theories. In this section, 'discs' refers only to 
non- star forming smooth discs - unsmooth discs (i.e., blue 
contaminants) have been removed using the criteria outlined 
in the previous section. We present some preliminary results 
for our sample of 709 quenched galaxies and also assess the 
degree to which our conclusions depend on which method 
is used. By and large, we find that the two methods agree 
well in a statistical sense, which shows that our automated 
method can be used with confidence for a larger sample to 
examine various scaling relations for bulges and discs sepa- 
rately. 

The size parameters used here are r-band absolute mag- 
nitude M r , stellar mass M,, velocity dispersion a, and 50% 
Petrosian radius Rsq. Absolute magnitudes were derived 
from the model magnitudes provided by the SDSS as de- 
scribed in i|2.3l Stell ar mass measurements are taken from 
iGallazzi et al l l|2005h : it should be noted that 40 galaxies 
have log M* = —99 (i.e., flagged as bad data values) in 
their catalog and are excluded in the M„ histograms. The 
velocity dispersion is a product of the SDSS pipeline (DR6, 
lAdelman-McCarthv et all l200ct ). The velocity dispersions 
used here are not aperture corr ected, but the corre ctions 
are small, of order ~ 6 per cent (|Graves et al.l|2009a| ). The 



Automated Morphological Classification of SDSS Red Sequence Galaxies 17 




Figure 14. Color-magnitude diagram using GALEX NUV — r. The bluest galaxies are likely to be star-forming. The horizontal line 
indicates a rough cut below which we consider galaxies to be contaminants. The blue symbols indicate the galaxies that are unsmooth 
discs (i.e., star- forming) using the (a) visual and (b) automated methods. Either method athes roughly half of all star- forming galaxies 
below the dashed line, leaving a quenced sample that is ~ 90 per cent pure (see text). Our analysis shows that using the morphology to 
identify star-forming galaxies is an effective way to remove contaminants from the quenched sample. 



physical 50 per cent Petrosian radius is calculated using the 
apparent size and the object's redshift. 

All four of these parameters (M r , M*, a, and -R50) are 
commonly used size indicators for early type galaxies, and 
previous studies have already shown that the relative frac- 
tion of bulge-dominated galax i es is greater among larger 
objects dSandage et al. I 1 19851 ; iMarzke et all 1 1994 1 19981; 
iMarinoni et alj 19991 ). A recent paper by van der Wei et al] 
l|2009D showed that massive red objects tend to be rounder 
(i.e., not disc-dominated). We therefore expect to find more 
bulges among brighter and bigger objects. However, re- 
cent stellar population work has revealed different trends 
for stellar population parameters depending on which size 
indicator is used, and in general t rends with a are t he 
sharpest (IGrave s fc Schiavonl l20Qg| ; IGraves et all l2009al lbl; 
IGraves k, Faberll2010h . We are interested in looking for dif- 
ferences in the bulge/disc frequencies as a function of the 
size parameters used here: will a again give the sharpest 
trend? Finally, since a and -R50 present the Fundamental 
Plane nearly face on, we can use the present sample to at- 
tempt a first mapping of bulge/disc frequencies directly on 
the Fundamental Plane. 

Figure [15] compares the visual and automated meth- 
ods directly by overplotting the two distributions for each 
type and size parameter. Visual classifications are plotted 
in gray, while the automated classifications are plotted in 
color. Bulge distributions (top row) agree extremely well in 
both shape and number of galaxies. Disc distributions (bot- 
tom row) also agree well with the possible exception that 
there are fewer small discs in the automated disc sample. 

This can be explained by noting that the objects that 
fall in opposite visual and automated classes (that is, visual 
bulges in the automated disc sample and vice versa) roughly 
follow the same distributions as their parent populations. 
Visual bulges occupy a much wider range in magnitude and 
go to brighter magnitudes than the visual discs. This can 
be seen by looking at the color-magnitude diagrams of the 
automated samples (Figure I12[) . Consequently, visual disc 



contaminants in the automated bulge sample are of small 
or intermediate size, but visual bulge contaminants in the 
automated disc sample can range from very small to very 
large. This has the effect of increasing the number of outliers 
in the automated disc sample while decreasing the number 
of outliers in the automated bulge sample. 

The shapes of the distributions for intermediate galaxies 
(middle row) also agree well, but fewer galaxies overall are 
classed as intermediates using the automated method com- 
pared to the visual method (a 37 per cent decrease, from 
237 to 150). This is not a fundamental discrepancy, as the 
relative numbers in the different automated classes could 
be easily adjusted by moving the classification boundaries 
slightly in Figures [7] [8] and [9] We elect to leave the bound- 
aries where they are for now and defer any changes to future 
work, if needed. 

The summary of numbers in Table 0] suggests that the 
automated intermediate sample contains roughly the same 
numbers of visual bulge and disc contaminants. Further- 
more, our previous attempt to more finely classify inter- 
mediates ( i]2.2[) yielded approximately the same number of 
bulgy and discy intermediates. Finally, the most robust re- 
sult of Table 2] is that pure bulges are rarely confused with 
pure discs but that intermediates are less reliably identified. 

With these points in mind, we abandon the intermediate 
classification altogether and assign half of the intermediate 
sample in each bin to the bulge and disc samples. We will 
refer to these new samples as bulges-f- (bulge sample plus 
half of the intermediate sample) and discs+ (disc sample 
plus half of the intermediate sample). The same procedure 
is followed for both the visual and automated classifications. 
Figure [in] compares the results by overplotting the distribu- 
tions from both methods for each new type. With the in- 
termediates divided in half, the agreement between the two 
classification schemes is now even better. 

Figures [T7] and [TH] show the number and percentage, re- 
spectively, of bulges+ (red) and discs+ (blue, dash-dotted) 
as a function of our four size parameters. The vertical lines 
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Figure 15. Comparison of the r-band magnitude M r , stellar mass M t , velocity dispersion <r, and 50 per cent Petrosian radius R50 
distributions of bulges, intermediates, and smooth discs using visual (gray) and automated (colored) classifications. The agreement 
between the methods is reasonably good for bulges (top) and discs (bottom), but worse for intermediates (middle), consistent with the 
results of [[3] and Table 0] 



indicate the boundaries within which the total number of ob- 
jects in each bin is greater than 20. In Figure[l8]the crossing 
value in each panel is calculated and is shown in the panel 
with an arrow. If there are multiple crossings, the average of 
the first and last crossings within the reliable range is taken. 
A comparison of the crossing values for the visual (top pan- 
els) and automated (bottom panels) methods shows that 
they are in good agreement, with differences less than ~ 0.5 
dex. Varying the bin sizes does not significantly affect the 
crossing values or the agreement between the two methods. 

It should be noted that the trends seem somewhat 
stronger using the visual method because some large visual 
bulges are classed as discs using the automated method. 
This can be explained by the previous observation that 
most bright galaxies are bulges, so errors at the bright end 
are more likely to move objects from bulges to discs. It is 
encouraging, however, that the trends are still clearly ob- 
served using both classification methods, and even the rela- 
tive strengths of the different trends are preserved. 

Finally, Figure [19] shows the bulge+ frequency mapped 
onto a nearly face-on projection of the Fundamental Plane, 



shown for both the (a) visual and (b) automated classifi- 
cations. Looking at trends in the horizontal direction (at 
constant R50), bulges+ clearly dominate at high a, while 
more discs+ are present at low a. This is especially evident 
at the largest values of -R50. No clear trends are seen in the 
vertical direction (at constant a), which may suggest that 
the bulge+ frequency is a stronger function of a than .R50. 
We defer a quantitative analysis of these trends to future 
work with a larger sample of SDSS galaxies. Though the ob- 
served trends are slightly weaker for the automated method, 
there is good qualitative agreement between the two meth- 
ods, which shows that the automated method can be used on 
larger samples in future studies of how morphology depends 
on structural parameters and environment, thereby provid- 
ing new constraints on the relative importance of various 
quenching mechanisms. 

We have repeated the preceding analysis excluding the 
galaxies with NUV — r ^ 4.5 (described in H3.2|l and find 
that the results are qualitatively unchanged. This is because 
the these bluest galaxies are distributed across the range of 
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Figure 16. Same as Figure fT5l but for the new samples bulges+ and discs+. When the intermediate sample is split evenly between the 
bulge and disc samples, the agreement in the distributions of the visual and automated samples is even better than in Figure ITol 



each of our four size parameters, so that while the absolute 
numbers change, the overall trends stay the same. 



5 SUMMARY 

We present a novel visual-based morphological classification 
scheme for red sequence galaxies based on SDSS images that 
is aimed at distinguishing between bulges and early type 
discs. Our interest in morphological information is moti- 
vated by the theoretical notion that mergers destroy stellar 
discs and create bulges. Future studies of bulge/disc fre- 
quencies using the new classification scheme may therefore 
provide a new constraint on the importance of mergers vs. 
other formation pathways in quenching galaxies with a va- 
riety of structural parameters in different environments. 

The major element of the visual scheme uses the sharp- 
ness of the outer light profile to assign a bulge class in- 
dex BC: galaxies with diffuse outer profiles are bulges 
(BC = 1), while galaxies with sharp outer boundaries are 
discs (BC = 3) . In addition, five features (spiral arms, bars, 
clumps, rings, and/or dust) that may indicate the presence 
of a cold disc component are collected into a single FEAT 



index. Two more indices (DIST and INT) measure large- 
scale morphological irregularities that might be caused by 
interactions and mergers. 

The eventual goal of measuring bulge/disc frequencies 
as functions of structural and environmental parameters will 
require more galaxies than can be classified visually. We 
therefore attempt to reproduce the visual scheme using a set 
of four machine-measured parameters. The resulting recipe 
maps concentration C and axis ratio b/a from SDSS together 
with smoothness s2 and bulge fraction B/T from GIM2D 
onto our visual parameters BC and FEAT (Figure [TOl) . 

A final comparison between the visual and automated 
methods are given in Table(4] If the visual classifications are 
regarded as 'truth,' the automated method identifies bulges 
with 75 per cent completeness and 73 per cent purity, and 
it identifies discs with 83 per cent completeness and 70 per 
cent purity. Most errors are one class; only 10 per cent of 
visual BC = 1 bulges are completely misclassified as auto- 
mated discs, and only 8 per cent of visual BC = 3 discs 
are completely misclassified as automated bulges. Eight per 
cent of the entire sample of 984 galaxies has an error of more 
than one class. Plausible explanations are given for many of 
these cases f i]3.1.2[) . 



20 Cheng et al. 



a) 
n 

E 

3 



0) 

E 
z 



140 

120 

100 

80 

60 

40 

20 

140 

120 

100 

80 

60 

40 

20 




Visual Classifications 



4* 



j i 



i '-i 



i M 
[ 

1 1 1 1 1 1 ■ i ■ ■ i 



Automated Classifications 



Lo 



- i 



n 



ec = i 4 2 
ec = 2,3 



h "i 



1 1 1 ■ i 



+ 



tu 



Bulges + 
Sm. Discs + 



J ; I. 



-20-21-22-23 10.010.511,011. 
M, (mag) logM. (M^) 



1.8 2.0 2.2 2.4 0.6 0.8 1.0 1.2 1.4 
loga (km s -1 ) logR^ (kpc) 



Figure 17. The numbers of bulges+ (red) and discs+ (blue dash-dotted) as a function of four measures of galaxy size: r-band magnitude 
M r , stellar mass M», velocity dispersion u, and 50 per cent Petrosian radius Rso- The vertical lines indicate the boundaries within which 
the total number of objects in each bin is greater than 20. Bulges+ tend to be brighter, with larger M*, a and R50 compared to discs+. 



The distribution of the inclinations of fitted discs sheds 
light on possible biases in the visual classifications. In partic- 
ular, it is common wisdom that discs are more easily missed 
in face-on systems, and the cos i distribution for GIM2D 
fitted discs is used to test for missing discs in both methods. 
Apart from an excess in one bin near cos i ~ 0.2 the cos i 
distributions in both methods are quite fiat. There may be a 
slight loss of the most face-on discs using the visual method, 
but the statistical significance of this result is low. 

The inclination distributions of visual smooth vs. un- 
smooth discs show a stronger bias. Discs with features 
[FEAT ^ 1) tend to be strongly face-on, indicating that fea- 
tures are often visually lost in edge-on discs. Moreover, the 
fraction of face-on discs (cos i > 0.8) that show features is 
high (~ 72 per cent) indicating that real early type discs al- 
most always contain bars, arms, clumps, rings, and/or dust. 
This is one of our major results. Since FEAT is evaluated 
completely separately from BC, the high frequency of fea- 
tures in our face-on BC = 3 discs lends further strong sup- 
port to our main criterion for identifying discs based on the 
sharpness of their outer light profiles. 

Since the goal of this project is to study quenched galax- 



ies, it is important minimize contamination by star-forming 
objects. We seek to do this by using only parameters widely 
available for SDSS galaxies in order to keep future sam- 
ples large. We demonstrate that roughly half of all star- 
forming contaminants can be identified using either the vi- 
sual or automated methods. Specifically, if unsmooth discs 
are equated to star-forming galaxies and are removed using 
the automated method, the resulting sample of 709 galaxies 
contains only 67 (9.4 per cent) visual unsmooth discs and 
only 80 (11 per cent) UV-blue galaxies. The main limitation 
for both methods is the angular size of the galaxy. When 
galaxies are too small, they are difficult to classify by eye 
and the automated parameters become less reliable (jjA). 

The method has been developed and tested on a sample 
of relatively low-z galaxies (z ~ 0.06) from the SDSS and can 
readily be used on future samples of large (_Rgo > 14"), low-z 
galaxies from the SDSS. A very large sample can be obtained 
by relaxing some of the criteria used here, for example, no 
longer requiring that the galaxies fall in the GALEX DR2 
footprint. 

The method has not been tested at high z, though it 
is likely to be useful out to z ~ 1, where many galaxies 
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Figure 18. The percentages of bulges+ (red) and discs+ (blue dash-dotted) as a function of four measures of galaxy size: r-band 
magnitude M r , stellar mass M», velocity dispersion a, and 50 per cent Petrosian radius R50. The vertical lines indicate the boundaries 
within which the total number of objects in each bin is greater than 20. The crossing value is indicated in each panel with an arrow. 
The general trends and crossing values agree well using both methods. 



have similar morphologies to low-z samples (e.g., lLotz et alj 
l200Sl ) and where images from the Hubble Space Telescope 
can provide similar physical resolution scales as those ob- 
tained by SDSS for our sample. It is likely to break down 
beyond z > 1, where galaxy morphologies may become sig- 
nificantly different from those observed at low redshift (e.g., 
iGenzel et all l200Sl : Ivan Dokkum et aljlioosl) . Further test- 
ing of the sort done in this paper will need to be done to 
apply the method to samples that are significantly different 
from the one used here. 

$4] conducts a preliminary exploration the distributions 
of bulges and discs using four different measures of galaxy 
size. Agreement between visual and automated methods is 
improved by assigning half of the intermediate class ran- 
domly to the bulge and disc samples (called bulges+ and 
discs+ respectively). In agreement with previous results, we 
find that bulges+ are in general brighter and have larger M* , 
a, and -R50 than discs+. The agreement between methods is 
quite good, though the trends appear to be slightly stronger 
when using the visual classifications. Figure [T9l shows a first 
attempt to map the distribution of bulges and discs directly 



onto the Fundamental Plane. Trends with a may be stronger 
than with _Rso in the plane, but this will have be checked 
with larger samples. 



We have presented a method to classify 'red and dead' 
galaxies in the SDSS using automated parameters from the 
galaxy fitting package GIM2D and the photometric pipeline 
of SDSS. In contrast to earlier automated methods, which 
typically distinguish only between early and late type galax- 
ies, our method classifies early type galaxies into bulge- and 
disc-dominated classes. The agreement between our visual 
and automated classifications is quite good, with < 10 per 
cent of our sample in complete disagreement. Furthermore, 
the distributions of the different morphologies in galaxy 
'size' are also reproduced reasonably well by the automated 
scheme, which can now be applied to a larger sample of 
SDSS galaxies to explore how morphology varies with struc- 
tural parameters, environment, UV flux, and other galaxy 
properties. 
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Figure 19. The bulge+ frequency mapped on to the Fundamental Plane using the (a) visual and (b) automated classifications. Red 
indicates a bin comprised solely of bulges+, while blue indicates a bin comprised solely of discs+. Small squares indicate bins that contain 
fewer than four galaxies, with the size indicating the number. The black lines indicate the locations of the vertical lines in Figures 1171 
and 1181 At high a the sample is clearly dominated by bulges+, while at low a there are many more discs+. The trend in _Rso may be 
somewhat weaker. Again, there is good qualitative agreement between the two methods. 
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(C ~ 3.5) should also have the highest bulge fractions B/T. 
Instead, we see that they have intermediate bulge fractions 
0.5 < B/T < 0.6 with a slight downturn in C at higher 
values of B/T. This occurs because the GIM2D fits that we 
are using require that the bulge component have a Sersic 
index n = 4, which is not always a good fit, as many giant 
ellipticals are known to have n > 4. These galaxies may have 
more light in the wings than an n = 4 profile can account 
for, and a disc is added to improve the fit. The fraction of 
light attributed to the disc is thus higher, and the bulge 
fraction is underestimated. 

We have investigated this effect by analyzing analogous 
GIM2D r-band fits where n is treated as a free parameter. 
These fits result in a B/T-C relation that is not peaked at 
intermediate values of B/T as in Figure [5] but rises contin- 
uously and gradually to B/T — 1. While the value of B/T 
changes for many galaxies, most of the galaxies for which 
B/T changes significantly have B/T > 0.6 and do not fall 
near the boundary. For visual discs with high B/T (BC = 3, 
B/T > 0.6), the distribution of Sersic indices peaks between 
n — 3 and n = 4, which indicates that assuming an n = 4 fit 
is reasonable for these galaxies. Overall, the results of our 
automated classifications using the two fits are similar and 
the choice of n = 4 versus floating-n fits has no significant 
effect on the results presented here. 



APPENDIX A: MORE ON THE VALIDITY OF 
THE GIM2D AND SDSS PARAMETERS 

Figure [5] shows that the most bulge-dominated galaxies also 
tend to be the most concentrated. We note, however, several 
features that bear more examination. 

Al GIM2D Bulge Fraction B/T 

The first notable feature in Figure[9]is the outliers with high 
values of C. We expect that the most concentrated objects 



A2 SDSS Concentration C 

The second notable feature in Figure [9]is the group of galax- 
ies that appears to fall below the main B/T-C relation, in 
the automated intermediate region. These generally prove 
to be small and faint. Figure lATT a) shows SDSS concentra- 
tion C as a function of SDSS 90 per cent Petrosian radius 
7?90 (i.e., galaxy apparent size) for the initial sample of 1295 
objects with color indicating absolute physical size. We use 
the full sample of 1295 because we are interested in how the 
apparent size of the galaxy affects the value C, especially 
for the small galaxies removed by the 'smallness' cut at log 
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Figure Al. Distributions of SDSS concentration C. (a) SDSS 
90 per cent Petrosian Radius vs. SDSS concentration for 1295 
galaxies, with color indicating absolute physical size. The vertical 
dotted line marks our adopted 'smallness' cut at log Rgo = 1.15 
f i]2,2|l . while the horizontal dotted line marks the C = 2.9 cut used 
in the automated scheme in Figure [9] For small values of R90 , 
there appears to be a ceiling to the value of C; as the apparent 
size of the galaxy decreases, the maximum value of C decreases 
as well. The smallness cut screens out most such objects, (b) 
SDSS axis ratio vs. SDSS concentration for 1295 galaxies, with 
color indicating GIM2D bulge fraction B/T. Three linear fits to 
different B/T bins are shown. The vertical dotted line indicates 
the b/a = 0.45 cut used in the automated scheme in Figure [8] 
The measured value of C appears to behaved differently as a 
function of b/a for bulges and discs. In particular, edge-on discs 
(low b/a) have higher concentration C that face-on discs (high 
b/a) with comparable B/T . This effect has also been noted by 
lYamauchi et al.l (120051) and lBailin fc Harris! teOQgt) . We conclude 
from these panels that the value of C is least reliable for the 
smallest and most elongated galaxies. 



Rgo = 1.15 in EI2.2I This cut is indicated by the vertical dot- 
ted line, while the adopted boundary between automated 
bulges and intermediates at C = 2.9 is indicated by the 
horizontal dotted line. For large apparent size, the low C 
galaxies tend to be large face-on discs, based on the visual 
classifications in Figure [9] For small apparent size, there ap- 
pears to be a ceiling to the value of C; as the apparent size of 



the galaxy decreases, the maximum value of C decreases as 
well, which is logical, as seeing smooths out the central peak 
in high-concentration galaxies. Furthermore, this maximum 
is not a strong function of the galaxy's absolute physical size. 

The last notable feature in Figure[9]is the large number 
of visual discs with rather high values of C, even at low val- 
ues of B/T. We find that these are often galaxies with elon- 
gated appearances (i.e., edge-on discs). Figure lATT b) shows 
SDSS concentration C as a function of SDSS axis ratio b/a 
for the same sample, with color indicating GIM2D bulge 
fraction B/T. The vertical dotted line indicates the value 
of b/a below which objects are classified as automated discs 
in Figure [5] while the horizontal dotted line indicates the 
value of C below which objects are classified as automated 
intermediates in Figure U] 

Two trends are seen here. First, the lack of galaxies 
with high C and low b/a is consistent with the idea that 
bulge-dominated galaxies (high B/T, red) are both round 
and centrally concentrated. Second, the measured value of 
C changes systematically for discs (low B/T, blue): edge- 
on discs (low b/a) have higher concentration C than face- 
on discs (high b/a) with comparable B/T. This is illus- 
trated with linear fits to galaxies with different values of 
B/T (indicated by col or) . This effect, as no t ed in work by 
lYamauchi et al.l (120051 ) and iBailin fc Harris! (120081 ), occurs 
because Rgo is measured using circular apertures. We con- 
clude from Figure [ATI that the value of C is least reliable for 
both the smallest and the most elongated galaxies. 

Our automated classifications are unlikely to be affected 
greatly by either effect. The former effect appears to be 
strongest for galaxies log Rgo ^ 1.15, all of which were elim- 
inated from the final sample. There may be a slight effect on 
small galaxies near the boundary, which may have concen- 
trations ~ 0.2 less than their 'true' values. As for the latter 
effect, the most elongated galaxies (b/a ^ 0.45) are already 
selected to be in the automated disc sample based on their 
axis ratios and will not contaminate the automated bulge 
sample even though they have high values of C. 
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